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We show that the load at each node in a preferential attachment network scales as a power of the 
degree of the node. For a network whose degree distribution is p{k) ~ k we show that the load 
is l{k) ~ k^ with ?7 = 7 — 1, implying that the probability distribution for the load is p{l) ~ l/l^ 
independent of 7. The results are obtained through scaling arguments supported by finite size scaling 
studies. They contradict earlier claims, but are in agreement with the exact solution for the special 
case of tree graphs. Results are also presented for real communications networks at the IP layer, 
using the latest available data. Our analysis of the data shows relatively poor power-law degree 
distributions as compared to the scaling of the load versus degree. This emphasizes the importance 
of the load in network analysis. 

PACS numbers: 



A variety of problems in fields ranging from the social 
sciences to biology to engineering deal with networks, for 
which a unified understanding has been sought [H [21 [3] . 
One of the models commonly used is the preferential at- 
tachment (PA) model due to Barabasi and Albert ^ and 
its generalizations [U [51 17] . This model generates scale- 
free networks, in which the probability for a node to have 
a degree k scales as p{k) ^ . Depending on the pa- 
rameters of the model, 7 can be varied continuously over 
the range 2 < 7 < 3. It has been argued that this model 
is appropriate to describe communications networks such 
as the Internet. 

Among the various properties of PA networks that have 
been studied is the distribution of load (with uniform de- 
mand) at different nodes of the network. This is defined 
by assuming that one unit of traffic flows between each 
pair of nodes in the network along the shortest path con- 
necting them[5]. (If there are multiple shortest paths 
between a pair of nodes, the traffic between them is di- 
vided equally among all the shortest paths [3].) In this 
setting, the amount of traffic flowing through a node is its 
load. Based on numerical simulations [lOj , it was claimed 
that the probability distribution for the load scales as 
p{l) ^ l/l^ with S — 2.2. Subsequently, data for net- 
works in various different fields were presented, and it 
was argued that [TT] there are two universality classes 
with 6 — 2 and 6 = 2.2. These claims were disputed [12] 
on the basis further numerical simulations, which seemed 
to indicate that S varies continuously with 7 and is there- 
fore not universal. For the special case of PA networks 
that are trees, it was argued [13] and then proved [14] 
that (5 = 2. 

In this paper, we show that the average load at nodes 
of degree k scales as l{k) with 77 = 7 — 1 for the 

PA model, regardless of 7. (As k is increased for fixed 
N, finite size effects are seen.) If we assume that the 
distribution of load for fixed k and N does not have an 
anomalously large width, this implies that the exponent 
5 is universal, but is equal to 2, contradicting the earlier 



claims [TOl [HI [12] , and extending the exact result for PA 
trees. We also extend the analytical proof of Ref. [1|] 
for tree graphs to show directly that 77 = 2, supporting 
the assumption that the distribution of I for fixed k is 
not anomalous. Our results are obtained by simple scal- 
ing arguments that arc reinforced by finite size scaling 
studies. The deviations from this universal result that 
are observed [TUl [H] are due to finite size scaling and 
subleading corrections to the asymptotic scaling form. 

We also show results for load analyses on networks 
drawn from a recent database |15j of connectivity of com- 
munications networks at the IP layer. The data are 
collected with new measurement techniques, and find 
many more routers and links than earlier studies [T5] . 
The results demonstrate that the scaling of the load 
l{k) ~ k"^ is much clearer than that of the — more com- 
monly studied — degree distribution. 

In the generalized PA model, a network grows one node 
at a time. Each node is born with m undirected edges 
which are attached to preexisting nodes. The probability 
of attachment to a preexisting node of degree k is pro- 
portional to fc + fco- Thus /cq and m are the parameters 
of the model, with fco < —m- For an infinite network, it 
can be shown that the probability of a randomly chosen 
node having a degree k is to 

Pk (X k^^ ^ = 3 + ka/m (1) 

for large fc. For such a network, we assume — as verified 
later through numerical simulations — that the average 
load lN{k) at all the nodes of degree fc in a network of N 
nodes has the scaling form 

/^(fc) = Nk'^iik/Nt') (2) 

where l{x) ^ 1 as x ^ and l{x) — + as a; — )■ 00. 
The prefactor of N is reasonable, since most of the load 
at nodes near the periphery of the network, for which 
fc ~ 0(1), is due to traffic that starts or ends there, 
and is therefore 0{N). The exponent 77 can be found by 



2 



noting that J2ki-^PkVN{k) is the total traffic flowing in 
the network. Since iV^ units of traffic are generated in 
the graph, and the average geodesic length is ~ In N, the 
sum should scale as ~ iV^ In N for large N. From Eqs.Q 
and this implies that 



1. 



(3) 



To flnd the exponent fi, we note that for a network of 
N nodes, the maximum degree kmax that is achieved 
can be estimated by requiring (1 — Pk)^ to be 

~ From Eq. ([!]), for large kmax this is equivalent 

to exp[— A7V/fc^~J.] ~ 0(1) with some constant A, from 
which kmax ~ A^^/(^~i). If we assume that all character- 
istic fc's scale with N in the same way, we obtain 



M= 1/(7-1). 



(4) 



For k « Nf", Eq. 
combined with Eq. 



implies that lM{k) ^ Nk^. When 
, we have p{l)dl ~ dl/l^ with 



6^ 



7-1 



1 = 2 



(5) 



where we have used Eq.([3]). 

Although these results are plausible, they are based 
on assumptions, most notably the scaling hypothesis of 
Eq.([2| itself. To check these assumptions, we turn to fi- 
nite size scaling numerical simulations. Networks with N 
ranging from 500 to 8000 or 16000 were generated for dif- 
ferent values of m and /cq. We considered the cases fcg = 
with m = 1,2,4,6 and m — 6 with ko = 1,2,3,4,5. For 
each choice of fco, to and A^, 10000 graphs were generated, 
with traffic flowing as described above. lN{k) was calcu- 
lated by averaging over all the nodes of degree k in the 
100 graphs. For the plots, it is more convenient to use 
the form lN{k) = NH{k/N'') instead of F,q.^, which 
one obtains if one uses Eqs.Q and ^. 

Figure [T] shows the results for to = 1 and fcp = 0, for 
which 11—1/2 from Eq.Q. The scaling collapse is rea- 
sonable, but the best fit straight line is in fact l{k) ^ 
consistent with earlier results |13j . In view of the analyt- 
ical results for PA tree graphs, this discrepancy must be 
attributed to finite size scaling effects which flatten the 
curve for large k and presumably reduce the apparent 
value of 77. The same discrepancy is seen for fco = with 
TO = 2,4,6; it is reasonable to attribute it to the same 
cause. 

Figure [2] shows a similar scaling plot for to = 6 and 
fco = 3, for which /i = 2/3. The scaling collapse and the 
fit to l{k) fc^.5 are both very good in this case. Figure[3] 
is a similar plot for to = 6 and ko = 5. The scaling 
collapse is again very good, but finite size corrections for 
large k now increase the slope of the curve. Thus the fit 
to the predicted form of ~ k"^^^ only works when k « 
A^6/7 but k >> 0(1). ( Unless the scaling hypothesis 
breaks down, Eq.(|3| follows from the In A factor in the 
total load. Therefore, we do not try a scaling plot with 
adjustable exponents..) We have also made similar plots 
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FIG. 1; Average traffic at nodes of degree fe as a function of 
k for the PA model with m = l,ko =0. (Here and in the 
subsequent figures, A^'/SOOO is used to scale both axes instead 
of A'^.) A scaling collapse with the exponents from Eqs.(|3| and 
Q works reasonably well. However, a straight line with the 
predicted slope of 2.0 is shown and only fits the curve — if at 
all — for small k. Similar deviations are seen for m = 2, 4, 6 
with fco = 0. 
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FIG. 2: Scaling plot of the average traffic at nodes of degree 
fc as a function of k for the PA model with m — 6, fco — 3. 
The scaling collapse is very good, as is the fit to ~ k^^^ in the 
scaling regime. 



for TO = 6 and ko — 1,2 and 4. For ko < 3, finite size 
corrections reduce the apparent 77 for large fc, while for 
fcp > 3, they increase the apparent rj for large k. This is 
consistent with Ref. |T2]. 

For the tree graphs generated by the PA model with 
TO = 1,/co = Oj the result l{k) ~ fc^ follows from p{k) ~ 
1/fc^ and p{l) ^ 1/P if we assume that l{k) scales as a 
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T<Ti<...Tfc 



(6) 



where 



P{T,T',m) = 1- 




2t 



1- 



2r' - 2 



(7) 

Replacing P{t,t' ,m) as the exponential of an integral 
instead of a sum in the approximation r >> 1, we have 
P{T,T',m) = {T/T'y^/^. With this, EqM simplifies to 



N N I 



r 1 



(8) 



where we have used the symmetry of the r^'s to eliminate 
the restriction ti < r2 . . . < . If the sums are replaced 
with integrals, pk+i n{t) = \/t/N[1 - ^/t/N]''^^^ and 
Pk = {l/N)j:^Pk,N{r) ~ l/[fc(fc+l)(fc + 2)] for large N. 

If ni , n2 , . . . nfc are the sizes of the k subtrees descend- 
ing from a node of degree k+1, one can show that for 
large N the load at the node is proportional to N^rii. 
If ni{t) is the size of the i'th subtree at time t > t^, with 
ni{N) = Hi, the probability that ni{t + 1) = ni{t) + 1 



is {2ni — l)/{2t), with the initial condition ni{t) = 1. at 
time t. Averaging over randomness for fixed r^, the solu- 
tion is (rii(i)) = {t/Ti + l)/2. With the symmetrization 
of the previous paragraph. 
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{k+l;T) cx N{J2n^) = Nk{r 



kN 
1 - X 



1 + xf 



-dxi 
(9) 

where Ti = Nxf,T = Nx'^ and we have replaced sums 
with integrals. This yields 



FIG. 3: A plot similar to Figure [2] but with fco = 5. For 
k ~ 0(1), the individual curves pull away from the scaling 
form. For k ~ 0{N^'''), finite size effects cause the curves 
to bend upwards. Between these two regimes, the slope is 
consistent with ~ k''^^ as predicted. 



power of k and that the distribution of / for fixed k is 
not anomalously broad. Although these are reasonable 
assumptions, it is not diSicult to prove l{k) ~ fc^ directly. 
The probability that the node created at time r will be 
attached to a preexisting node of degree k is equal to 
fc/(2r — 2). Therefore the probability that a node created 
at time r will have exactly k nodes subsequently attached 
to it is 

^ 1 
Pu+i,N{r)= E ^(^+1' ^1-1' 1)^,7 ^P(Ti-l-l,r2-l,2 



lN{k + 1) oc Nk-^ / x(l - xr[2k{l + x) + x]dx cx iVr 
Jo 

(10) 

where we have only kept the terms that are relevant for 
iV >> fc >> 1. The terms dropped with the fc >> 1 
approximation are corrections to asymptotic scaling and 
cause the imperfect collapse in Figure [l] . 
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FIG. 4: Log log plot of N/(kl{k)) as a function of k for the 
Sprintlink network [TS]. The dashed line is visually adjusted 
for best fit, and has a slope of —2.3 corresponding to l{k) ~ k^ 
with -q = 1.3. The plot also shows the degree distribution p{k) 
and the cumulative distribution P{k) = The inset 

shows the cumulative load distribution P(l) — J^°° p{li)dli 
and a straight line with slope -1. 



We now compare with network data from the Rocket- 
fuel database [15 , which is the most recent, comprehen- 
sive and publicly available collection of measurements of 
the connectivity between nodes of communications net- 
works at the IP layer. There are ten networks with 
121 to 10214 routing nodes (from here on referred to as 
"routers") in the database. Since the data is insufficient 
to test scaling functions (in our simulations we worked 
with 100 networks for each N, with 500 < TV <~ 10000), 
we only consider how the traffic at nodes scales with their 
degree without regard to any finite size cutoff. Figure |4] 
shows the results for one of the larger networks in the 
database, the Sprintlink network with 8355 routers. The 
load as a function of degree fits quite well to a power 
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FIG. 5: Plot similar to Figure |4] but with results from all ten 
networks in the database combined. 



law with a slope of 1.3. The degree distribution itself is 
much more irregular, and it is difficult to say whether 
it is of the form ^ k~'' with 7 = ry + 1. The cumula- 
tive degree distribution is also shown in the figure; it 
is much smoother, but does not show clear power law 



behavior. The inset shows the cumulative load distribu- 
tion and a straight line with the predicted slope of -1, 
which is equally unconvincing. Figure |5] is a similar fig- 
ure with all ten networks in the database merged. The 
plots are straighter, and one could perhaps argue for a 
narrow power law regime in the cumulative degree distri- 
bution as is done in Ref. |15j or a regime with slope — 1 
in the inset. However, the load versus degree has a much 
closer linear behavior in the log-log plot, and is there- 
fore a more natural demonstration of power-law scaling 
in communications networks than the more commonly 
studied degree distribution. 

In conclusion, we have shown that for a preferential 
attachment model with degree distribution p{k) ^ fc"'*', 
the average traffic as a function of node degree scales as 
l{k) ~ k''^~^. This is equivalent to the statement that the 
probability distribution for the load scales as p{l) ^ 1 /P 
regardless of 7. Although the numerical simulations and 
analytical calculations are for a specific model, the result 
follows from the scaling assumption and the small-world 
phenomenon and is therefore more robust. The scaling 
l{k) ~ k'^ is also seen clearly in networks at the IP layer, 
and is in fact much better than the degree distribution 
which has attracted much more interest. 
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